Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning
نویسندگان
چکیده
Planning for multi-agent systems such as task assignment for teams of limited-fuel unmanned aerial vehicles (UAVs) is challenging due to uncertainties in the assumed models and the very large size of the planning space. Researchers have developed fast cooperative planners based on simple models (e.g., linear and deterministic dynamics), yet inaccuracies in assumed models will impact the resulting performance. Learning techniques are capable of adapting the model and providing better policies asymptotically compared to cooperative planners, yet they often violate the safety conditions of the system due to their exploratory nature. Moreover they frequently require an impractically large number of interactions to perform well. This paper introduces the intelligent Cooperative Control Architecture (iCCA) as a framework for combining cooperative planners and reinforcement learning techniques. iCCA improves the policy of the cooperative planner, while reduces the risk and sample complexity of the learner. Empirical results in gridworld and task assignment for fuel-limited UAV domains with problem sizes up to 9 billion state-action pairs verify the advantage of iCCA over pure learning and planning strategies.
منابع مشابه
A New Intelligent Approach to Patient-cooperative Control of Rehabilitation Robots
This paper presents a new intelligent method to control rehabilitation robots by mainly considering reactions of patient instead of doing a repetitive preprogrammed movement. It generates a general reference trajectory based on different reactions of patient during therapy. Three main reactions has been identified and included in reference trajectory: small variations, force shocks in a single ...
متن کاملActor-Critic Policy Learning in Cooperative Planning
In this paper, we introduce a method for learning and adapting cooperative control strategies in real-time stochastic domains. Our framework is an instance of the intelligent cooperative control architecture (iCCA). The agent starts by following the “safe” plan calculated by the planning module and incrementally adapting its policy to maximize the cumulative rewards. Actor-critic and consensusb...
متن کاملCirca: the Cooperative Intelligent Real-time Control Architecture Circa: the Cooperative Intelligent Real-time Control Architecture Table of Contents
CIRCA: THE COOPERATIVE INTELLIGENT REAL-TIME CONTROL ARCHITECTURE by David John Musliner Co-Chairs: Kang G. Shin and Edmund H. Durfee The Cooperative Intelligent Real-time Control Architecture (CIRCA) is a novel architecture for intelligent real-time control that can guarantee to meet hard deadlines while still using unpredictable, unrestricted AI methods. CIRCA includes a real-time subsystem u...
متن کاملEmotional Learning Based Intelligent Controller for MIMO Peripheral Milling Process
During the milling process, one of the most important factors in reducing tool life expectancy and quality of workpiece is the chattering phenomenon due to self-excitation. The milling process is considered as a MIMO strongly coupled nonlinear plant with time delay terms in cutting forces. We stabilize the plant using two independent Emotional Learning-based Intelligent Controller (ELIC) in par...
متن کاملThe Impact of Cooperative Learning and Mobile Learning through Bluetooth Device on Vocabulary Learning of Iranian EFL Learners
Cooperative learning has been found to affect different aspects of language learning by many researchers (e.g., Kagan, 1995; Kagan, 1999; Kessler, 1992; McGroarty, 1993). Likewise, mobile assisted language learning (MALL) has revealed significant impacts on the improvement of different language skills and components (e.g., Comas-Quinn et al. 2009; Divitini & Chabert, 2009; Motallebzadeh & Ganja...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of Intelligent and Robotic Systems
دوره 72 شماره
صفحات -
تاریخ انتشار 2013